TALKING FOREIGN - concatenative speech synthesis and the language barrier
نویسنده
چکیده
This paper presents some solutions to the problem of synthesising multi-lingual speech using waveformconcatenation speech synthesis. The paper presents novel methods for deriving appropriate pronunciations for foreign words in a predominantly native-language text by use of multi-speaker synthesis, and methods for mapping the pronunciations of a foreign-language speaker onto the sounds available in the speech corpus of a native speaker so that the resulting synthesis produces speech which accurately represents the foreign words. The methods differ depending on the language-pair and on the direction of the mapping, because in the case of one-to-many phonemic mappings, high-level features can be used, but in the many-to-one case, a physical representation of the speech signal is required so that use can be made of the natural variability in speech production to select the most appropriate allophonic variant. All mappings are automatic, and the use of rule-based procedures which require human knowledge is minimised. In this way, the methods are extensible to any language combinations. Synthesised speech samples are included with the paper so that subjective evaluation of the results can be made.
منابع مشابه
Multi-lingual concatenative speech synthesis
This paper describes a method of concatenative speech synthesis that makes use of 3-dimensional labelling of speech, and shows how this can be applied to the synthesis of both mono-lingual and foreign-language speech. The dimensions encode phonetic, prosodic, and voicequality information in order to fully describe the acoustic characteristics of each speech segment.
متن کاملForeign-language Speech Synthesis
This paper describes a method of concatenative speech synthesis for producing speech in a language other than that of the database speaker. In certain applications, such as interpreted dialogues or multi-lingual e-mail, it is necessary to synthesise words that are foreign with respect to the language of the main text. In this case, rather than switch voices, we show that the use of an intermedi...
متن کاملIntroduction to multilingual corpus-based concatenative speech synthesis
This tutorial paper addresses foreign-language support in corpus-based concatenative text-to-speech systems. We give an overview of application domains where strictly monolingual speech synthesis is not sufficient and where multilingual text-to-speech is required or highly desirable. We describe two approaches to multilingual corpus-based speech synthesis: phoneme mapping on the one hand, and t...
متن کاملDesign of English to Hindi Corpus Based Text Conversion and Hindi Text to Speech Synthesis
English is a global language but is understood by few percentage of population in India. It continues to remain a barrier for rural population to learn and compete at a global level. Machine translation helps people from different places to understand an unknown language without the aid of human translator. A Text to Speech system generatesspeech from text given as input. The proposed system wi...
متن کاملمراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی
Abstract Speech databases are part of the concatenative text to speech synthesis systems. Phonetic quality of the databases plays a significant role in the naturalness of the synthesized speech. This paper introduces two syllable and diphone speech databases for Persian and investigates the way of their development and their specifications and their advantages to each other. ...
متن کامل